No d ’ ordre : 2012 - ISAL - 0094 Année 2012 THÈSE
نویسندگان
چکیده
Pattern discovery in large binary relations has been extensively studied. Typically, it needs to compute patterns that hold in relations Objects×Properties that denote whether given properties are satisfied or not by given objects. An emblematic success in this area concerns frequent itemset mining and its post-processing that derives association rules. It is however clear that many datasets correspond to n-ary relations where n > 2. For example, adding spatial and/or temporal dimensions (location and/or time when the properties are satisfied by the objects) leads to the 4-ary relation Objects× Properties× Places× Times. Therefore, we study the generalization of association rule mining within arbitrary n-ary relations: the datasets are now Boolean tensors and not only Boolean matrices. Unlike standard rules that involve subsets of only one domain of the relation, in our setting, the head and the body of a rule can include arbitrary subsets of some selected domains. A significant contribution of this thesis concerns the design of interestingness measures for such generalized rules: besides a frequency measures, two different views on rule confidence are considered. The concept of non-redundant rules and the efficient extraction of the non-redundant rules satisfying the minimal frequency and minimal confidence constraints are also studied. To increase the subjective interestingness of rules, we then introduce disjunctions in their heads. It requires to redefine the interestingness measures again and to revisit the redundancy issues. Finally, we apply our new rule discovery techniques to dynamic relational graph analysis. Such graphs can be encoded into n-ary relations (n ≥ 3). Our use case concerns bicycle renting in the Vélo’v system (self-service bicycle renting in Lyon). It illustrates the added-value of some rules that can be computed thanks to our software prototypes.
منابع مشابه
N ° d ’ ordre : ANNÉE 2012 THÈSE / UNIVERSITÉ DE RENNES 1
The aim of the study was to assess the feasibility of CT based 3D analysis of degenerated aortic bioprostheses to make easier their morphological assessment. This could be helpful during regular follow-up and for case selection, improved planning and mapping of valve-in-valve procedure. The challenge was represented by leaflets enhancement because of highly noised CT images. Contrast-enhanced E...
متن کاملIntroduction de fonctionnalités d'auto-optimisation dans une architecture de selfbenchmarking. (Introduction of self-optimization features in a self-benchmarking architecture)
Thèse dirigée par Patrice MOREAUX et codirigée par Bruno DILLENSEGER préparée au sein du Laboratoire LISTIC dans l'École Doctorale SISEO Introduction de fonctionnalités d'auto-optimisation dans une architecture de selfbenchmarking Thèse soutenue publiquement le 25 septembre 2012 devant le jury composé de : Remerciements Lorsque vous résolvez unprobì eme, vous devriez remercier Dieu et passer au...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012